Detection of gene-gene interactions using multistage sparse and low-rank regression.

نویسندگان

  • Hung Hung
  • Yu-Ting Lin
  • Penweng Chen
  • Chen-Chien Wang
  • Su-Yun Huang
  • Jung-Ying Tzeng
چکیده

Finding an efficient and computationally feasible approach to deal with the curse of high-dimensionality is a daunting challenge faced by modern biological science. The problem becomes even more severe when the interactions are the research focus. To improve the performance of statistical analyses, we propose a sparse and low-rank (SLR) screening based on the combination of a low-rank interaction model and the Lasso screening. SLR models the interaction effects using a low-rank matrix to achieve parsimonious parametrization. The low-rank model increases the efficiency of statistical inference and, hence, SLR screening is able to more accurately detect gene-gene interactions than conventional methods. Incorporation of SLR screening into the Screen-and-Clean approach (Wasserman and Roeder, 2009; Wu et al., 2010) is also discussed, which suffers less penalty from Boferroni correction, and is able to assign p-values for the identified variables in high-dimensional model. We apply the proposed screening procedure to the Warfarin dosage study and the CoLaus study. The results suggest that the new procedure can identify main and interaction effects that would have been omitted by conventional screening methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease

Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...

متن کامل

Gene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method

Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...

متن کامل

Differential Expression of Alpha S1 Casein and Beta-Lactoglobulin Genes at Different Physiological stages of the Adani Goats Mammary Glands

Background: Milk proteins genes have been the focus of the researches as the candidate target genes that play a decisive role when animal breeding is desired.Objectives: In the present study, the transcriptional levels of Beta-lactoglobulin (BLG) and Alpha S1 casein (CSN1S1) genes were investigated during prenatal, milking and drying times in mammary glands of the Adani goats which showed...

متن کامل

Spike and Slab Gene Selection for Multigroup Microarray Data

DNA microarrays can provide insight into genetic changes that characterize different stages of a disease process. Accurate identification of these changes has significant therapeutic and diagnostic implications. Statistical analysis for multistage (multigroup) data is challenging, however. ANOVA-based extensions of two-sample Z-tests, a popular method for detecting differentially expressed gene...

متن کامل

Estimation of Genetic Components and Inheritance of Bread Wheat Agronomic Traits Using Regression Method Through Generation Mean Analysis

Studying the genetic structure of crops, including wheat, has always been one of the research priorities to increase the efficiency of breeding methods. In order to genetic analysis of some agronomic traits of bread wheat using generation mean analysis (GMA), all produced generations along with relevant parents of the two populations (Marvdasht × Rasoul and Marvdasht × Shahpasand) were evaluate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biometrics

دوره 72 1  شماره 

صفحات  -

تاریخ انتشار 2016